Quantum algorithm for online convex optimization
نویسندگان
چکیده
Abstract We explore whether quantum advantages can be found for the zeroth-order online convex optimization problem, which is also known as bandit with multi-point feedback. In this setting, given access to oracles (that is, loss function accessed a black box that returns value any queried input), player attempts minimize sequence of adversarially generated functions. This procedure described $T$ round iterative game between and adversary. paper, we present algorithms problem show first time potential are possible problems optimization. Specifically, our contributions follows. (i) When allowed query $O(1)$ times in each feedback, give algorithm achieves $O(\sqrt{T})$ regret without additional dependence dimension $n$, outperforms already optimal classical only achieving $O(\sqrt{nT})$ regret. Note has achieved lower bound first-order methods. (ii) strongly functions, achieve $O(\log T)$ queries well, means same full information setting.
منابع مشابه
Adaptive Bound Optimization for Online Convex Optimization
We introduce a new online convex optimization algorithm that adaptively chooses its regularization function based on the loss functions observed so far. This is in contrast to previous algorithms that use a fixed regularization function such as L2-squared, and modify it only via a single time-dependent parameter. Our algorithm’s regret bounds are worst-case optimal, and for certain realistic cl...
متن کاملOnline Convex Optimization
A convex repeated game is a two players game that is performed in a sequence of consecutive rounds. On round t of the repeated game, the first player chooses a vector wt from a convex set A. Next, the second player responds with a convex function gt : A → R. Finally, the first player suffers an instantaneous loss gt(wt). We study the game from the viewpoint of the first player. In offline conve...
متن کاملOnline convex optimization
1.1 Definitions We say a set S ⊆ Rd is convex if for any two points x,x′ ∈ S, the line segment conv{x,x′} := {(1−α)x+αx′ : α ∈ [0, 1]} between x and x′ (also called the convex hull of {x,x′}) is contained in S. Overloading terms, we say a function f : S → R is convex if its epigraph epi(f) := {(x, t) ∈ S × R : f(x) ≤ t} is a convex set (in Rd × R). Proposition 1. A function f : S → R is convex ...
متن کاملOnline Learning and Online Convex Optimization
Online learning is a well established learning paradigm which has both theoretical and practical appeals. The goal of online learning is to make a sequence of accurate predictions given knowledge of the correct answer to previous prediction tasks and possibly additional available information. Online learning has been studied in several research fields including game theory, information theory, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Quantum science and technology
سال: 2022
ISSN: ['2364-9054', '2364-9062']
DOI: https://doi.org/10.1088/2058-9565/ac5919